Availability Requirement for a Fault Management Server in High-Availability Communication Systems
نویسندگان
چکیده
Conclusions In this paper, we investigate the availability requirement for the fault management server in high-availability communication systems. According to our study, we find that the availability of the fault management server does not need to be 99.999% in order to guarantee a 99.999% system availability as long as the fail-safe ratio (the probability that the failure of the fault management server will not bring down the system) and the fault coverage ratio (the probability that the failure in the system can be detected and recovered by the fault management server) are sufficiently high. Tradeoffs can be made among the availability of the fault management server, the fail-safe ratio and the fault coverage ratio to optimize system availability. A cost-effective design for the fault management server is proposed in this paper.
منابع مشابه
Trends in Data Management: 'High Availability'
In today’s highly competitive business environment, for most companies it is imperative that their business data is always accessible from their data servers in a seamlessly fault tolerant manner. DBMSs, which constitute the core of most business information systems, are attempting to implement this requirement through a feature often referred to in the data management literature as ‘high avail...
متن کاملLightweight Fault-tolerance for Highly Cooperative Distributed Applications
The recent introduction of high-speed networks, faster processors, and the rapid growth of heterogeneous large-scale distributed systems has enabled the development of distributed applications that move beyond the client-server model to truly harness the computational potential of distributed systems. These new applications will be structured around groups of agents that communicate using messa...
متن کاملQueries for Ibms-47-04-05 Availability Analysis of Blade Server Systems &
This manuscript/text has been typeset from the submitted material. Please check this proof carefully to make sure there have been no font conversion errors or inadvertent formatting errors. Allen Press. The successful development and marketing of commercial high-availability systems requires the ability to evaluate the availability of systems. Specifically, one should be able to demonstrate tha...
متن کاملNew Paradigms in Check-pointing Techniques in Distributed Mobile Systems
Distributed systems today are ubiquitous and enable many applications, including client–server systems, transaction processing, the World Wide Web, and scientific computing, among many others. Distributed systems are not fault-tolerant and the vast computing potential of these systems is often hampered by their susceptibility to failures. Many techniques have been developed to add reliability a...
متن کاملConngurable Highly Available Distributed Services
A service provided by a computing system is characterised as fault-tolerant [4] when it continues to be provided according to its speci cations despite failures of system components (software or hardware) that participate in the service provision. With the ever increasing introduction of computing systems in many aspects of today's life, fault-tolerance of critical services becomes of great imp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Reliability
دوره 52 شماره
صفحات -
تاریخ انتشار 2002